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Abstract 

Consider a Markov process uj t at stationarity and some event C (a subset of the state- 
space of the process). A natural measure of correlations in the process is the pairwise 
correlation P^o^t G C] — P[loq 6C] 2 . A second natural measure is the probability 
of the continual occurrence event {w s G C, Vs G [0,t]}. We show that for reversible 
Markov chains, and any event C, pairwise decorrelation of the event C implies a decay 
of the probability of the continual occurrence event {u) s G C Vs G [0, £]} as t — >• oo. Wc 
provide examples showing that our results are often sharp. 

Our main applications are to dynamical critical percolation. Let C be the left-right 
crossing event of a large box, and let us scale time so that the expected number of 
changes to C is order 1 in unit time. We show that the continual connection event has 
superpolynomial decay. Furthermore, on the infinite lattice without any time scaling, 
the first exceptional time with an infinite cluster appears with an exponential tail. 

1 Introduction 

We study the relationship between pairwise decorrelation of a specific event and the decay 
rate of the probability of continual occurrence of the event in reversible Markov pro- 
cesses. In particular, Theorem 1 1 . 1 1 b elow states that any decay of the pairwise correlations 

for the process (ut)t>o in stationarity implies a comparable decay of the joint probability 

P[co s e A for all < s < t] . 

Given a Markov process u>t on S with stationary probability measure ir, its time 1 Markov 
operator (Tif)(ui) := E[/(wi) | ujq = oj] on / £ L 2 (S,ir) is a normal operator, hence it has 
a spectral decomposition, with Spec(Ti) C {z : \z\ < 1} C C and z = 1 being an obvious 



1 



eigenvalue. Its spectral gap is denned as g := inf {|1 — A| : A e Spec(Ti) \ {1}}, while its 
absolute spectral gap is g* := 1 — sup {|A| : A e Spec(Ti) \ {1}}. It is well-known (and 
not hard to see) that g* > is equivalent to having an exponential decay of correlations for 
any function / : S — > R with E[/] = 0: 

E[/M/fo)]<(l-<70*E[/ 2 ], 

for the process at stationarity. We write P both for both the law tt on static configurations 
and for the measure of the process at stationarity. Similarly, E denotes expectation of a 
function with respect to one or other of these laws, depending on whether the function 
is defined on static or dynamic configurations It is quite classical for this case (in fact, 
g > suffices), see |AKSz87| . |AFWZ95j . |HLW06l Theorem 3.6] and [A"S00l Theorem 
9.2.7], that for any set A C S with stationary measure bounded away from 1, the exit-time 
tail P[ui s E A for all < s < t] is exponentially small in t, with an exponent depending on 
the spectral gap and on tt(A). The strongest such bound is the one in [ASOOj . with a 
generalization in [MODRSS061 Theorem 5.4]. 

Our Theorem 1 1.1 1 is a generalization of these results for the case when we have a pairwise 
correlation decay not for any function, but only for being in a given A — which might 
happen on a much faster time-scale than the mixing time of the entire chain. In other 
words, our generalization concerns the hidden Markov chain l{ Wt gA}- Furthermore, our 
Theorem 11.21 gives a generalization of [ MODR SSOo] Theorem 5.4] in a different direction, by 
showing that, assuming a spectral gap, the exit-time tail from A is exponentially decaying 
provided that the probability that ujt is in A at every moment of a fixed time interval is 
bounded away from one (which may be the case even if tt(A) is arbitrarily close to 1). 

The exponential exit-time tail for Markov chains with spectral gap (such as random 
walks on expander graphs) has many applications in computer science including derandom- 
ization of algorithms [HLW061 Section 3] and noise sensitivity |MODRSS06j , suggesting that 
our results may prove useful from such points of view. Nevertheless, our initial motivation 
comes from the study of dynamical percolation on planar lattices, which is the natural time 
evolution of critical percolation in the plane, a central model of statistical mechanics; see 
|HgPS97j IBKS991 [SStlOl IGPSlOal IGPSlObl iGPSl IHmPS12| for the original papers, and 



|Ste091 IGS12j for surveys. The implications of our results to dynamical percolation will be 
explained in Section [H 

We now state our main results in detail. 

1.1 Exiting an event with some pairwise decorrelation 

We will consider continuous or discrete time Markov processes, (uJt)teR ° r ( w t)tez ; on some 
state space S, with some (not necessarily unique) stationary probability measure it; we will 
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always consider the process run in stationarity, i.e., with ujq ~ 71". For functions / : S — > M, 
consider the usual inner product (/, g) := Ef/g], and the Markov operator (Ttf)(io) := 
E[/(wt) | ujq = to]. Let C be a static event (i.e., measurable with respect to loo), suppose 
that 7r(C) = P [loq € C] = p, and let f = Iq- The decay of correlations of / in time is often 
quantified by the function d : (0, oo) — > [0, oo) in one of the following two inequalities: 

P[oj ,oj t EC]- P[co € C] 2 = (/,TJ) - (E/) 2 < d(t) Var[/] (1.1) 

and 

Var[T t /] = (T t f, T t f) - (E/) 2 < d{2t) Var[/] , (1.2) 

for all t G [0, oo). Of course, for reversible Markov processes, (jl.ip is equivalent to ()1.2j) . We 
will consider the cases where the decay of d(t) as i — > oo is either polynomial or (stretched) 
exponential. Sometimes, one has a sequence of Markov processes (ojf)t^R, n G N, on larger 
and larger finite state spaces, with the time parameter coming from the original time of the 
process rescaled by a function of n. In this case, the bounds are understood uniformly in n. 

Theorem 1.1. In the above setting, assuming we have that 

P [„. e C V, e [0, «]] < ,ni» { (E±i) ' + ^ (f ) } , (1.3, 

and therefore 

£- a +°W t/d(t) = 6(r a ), 

exp ( - t^ +o(1) ) ifd{t) = exp(-9(t Q )) , 

as t — > oo, where the o(l) terms depend only on p, a and the constant factors implicit in 
the G(-) notation. 

Examples and questions of sharpness and of non-sharpness in Theorem 11.11 appear in 
Section [3l 

Remark. Note that we make no assumption of reversibility in Theorem 11.11 However, as 
we have noted, for a reversible Markov chain, we may replace the assumption (|1.2p in the 
statement by (11. ip . which is a more familiar form in which to express decorrelation in a 
Markov process. 

Motivation. As we mentioned above, our main motivation is dynamical critical percolation 
on planar lattices: site percolation on the triangular lattice or bond percolation on Z 2 . Let 
C be the left-right crossing event of a large box, and let us scale time such that the expected 
number of changes to C is order 1 in unit time. Theorem 11.11 implies that the continual 
connection event has superpolynomial decay. See Corollary 14.11 
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1.2 Exiting events denned on time intervals, assuming a spectral gap 

We consider a continuous time Markov process semi-group Tt = e 1 ® on some state space S, 
reversible with respect to a probability measure tt. Then the infinitesimal generator Q is 
self-adjoint (reversible) and negative semi-definite with respect to the usual inner product 
given by n, and its spectrum is contained in (—00, 0]. We will assume that Q has a spectral 
gap 5 > around the obvious eigenvalue 0; then T t has an absolute spectral gap 1 — e~ St , 
and the process is ergo die. 

Let f2 be the space of all paths cj : K — > S of the Markov process under the probability 
measure P, and let L 2 (Q,P) denote all L 2 integrable functions from f2 to M. For a subset 
ICflwe denote by Tj the sigma algebra generated by {co(t) : t & I}. 

Theorem 1.2. Suppose that the generator Q is reversible with respect to a probability 
measure tt and has spectral gap 5 > 0. Let k S N + ; and let aj, 6, G K, < i < k satisfy 
a>i < bi for such i, as well as bi < a^+i for < i < k — 1; we also permit ao = —00 as well 
as bk = 00. 

Let Aq, ■ ■ ■ ,Ak be subsets of ft, each Ai measurable with respect to T^^a. Then 

k fc-i 

p [p| Ai] < vWVpIAI II [v / pWv / p[A^I + e- 5 ^-"^ (1 - v/pWvMA+i]) 

i=0 i=0 

In particular, suppose that A is measurable with respect to J~[ a ,b] with ~P[A) = p, and [ai, bi] = 
[a + Si,b + s^. Setting ti = Sj + i — — (b — a) for all < i < k, then, provided that ti > 
for all such i, 

fe-i 

<p]J {p + e- sti (l-p)). 

i=0 

Motivation. As we will show in Corollary 14.41 this theorem implies for dynamical perco- 
lation on the infinite lattice that the probability that there is no exceptional time in [0, t] 
with the cluster of the origin being infinite is exponentially small. This corollary plays a 
significant role in |HmPS12] . 
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2 Proofs 

2.1 Exiting an event with some pairwise decorrelation 

We now prove Theorem I l.li 

Proof. Let p < A < 1. Consider the static event 

A s := jcj G S : P[uj s G C \ oj = u] < A j . 

Note that for s = we have P [Aq] = p, while for large s one expects P [Ajj] to be small. 
Let r = i/fc, for some & G Z+. We claim that, for m > 0, 

P [w G^nC; w jT G A r n C for 1 < j < m] < \( m ~V v0 P [oj G A^] . (2.1) 

We may prove this by induction on m, the cases where m G {0, 1} being trivial. For m > 2, 
writing B m for the event on the left-hand side of ()2. 1 j) . we have that P [-B m ] = P [-B m _i] g m , 
where g m is the conditional probability of w mr G A T nC given B m -\. Note that g m is at most 
P[w mr G C | B m _i]. The conditional distribution of 0J( m -i) T given -B m _i being supported 
on the event W( m _i) r G A T , it follows from the Markov property that P[w mr G C | < 
A. Hence, the inductive hypothesis at m — 1 implies this statement at m, giving (|2.ip . 
By the same argument, we see that, for each m > 0, 

P [uj jT G A T n C for < j < m] < A m . (2.2) 

We find then that 

G C Vs G [0, t] < P G C for < j < k] 

< P [uj jT G C n A T for < j < k] + 

A; 

^2 P [w £t Gi'nC; w iT e A T DC for £ < j < k] 



Co'. 



k 



<^ + £ A N-i)vo p [ %6i c 



£=0 

In the third inequality, (|2.2p was used to bound the first term on its left-hand side, while 
the summand was bounded using (|2.1[) with m = k — £ and stationarity. 
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We need to find now an upper bound on P L4£] for s large. By the definition of A s , 
E [1 AS T 8 f] = E [f( Ua ) | A*] P [A;] > A P [A;] , 
where, as before, f = Tic- On the other hand, 

E [1 A% T s f] = E [l A cp] + E [l A c (T s f - E/) 
Putting these two things together, 

e[i a =(T s /-E/)] >(X-p)P[A c s ]. 
Applying Cauchy-Schwarz to the left-hand side, 

||l A c|| 2 ||T s /-E/|| 2 >(A-p)P[^], 

so that we obtain 

Var[T s /]V2 = ||r s/ _ E /|| 2 > (X - p)P[A c s ] 1/2 
Therefore, (jl.2p implies that 



(2.4) 



uj s £C\/s£ [0, t] 



(A-p)2 

Now take s = t, where recall that r = t/k for some k G Z + that we will shortly specify. 
Plugging (g5D into ff2T3|) . we find that 

Setting A = (p+ l)/2 yields (fOD . 

For the case = t~ a , setting k = [K logij for a suitable constant K = K(X,a) 
makes both terms t~ a , as desired. For the case d{t) = exp(— t a ), we set k = [t@\, and 
optimize the upper bound by letting a(l — /3) = f3, i.e., choosing (3 = a/(l + a), and we are 
done. □ 



2.2 Exiting events denned on time intervals, assuming a spectral gap 

In this subsection we prove Theorem 11.21 There are two main ideas. The first is that 
if a chain has a spectral gap, then the associated Markov operator will be a strict L 2 - 
contraction not only on functions with zero mean, but also on any function whose support 
has a stationary measure bounded away from 1. The second idea is to use conditional 
expectation and the Markov property to extend the first idea to functions defined not 
on the state space, i.e., at individual times, but on time intervals. These two ideas are 
formalized in the following lemma. 



6 



Lemma 2.1. Suppose that the infinitesimal generator Q is reversible with respect to a 
probability measure ir and has spectral gap 5 > 0. Let Ai,A2 be subsets of Q that are 
measurable with respect to J-< a and J-> a +t , respectively. Let P\ and P 2 be the corresponding 
projection operators on L 2 (S7,P), i.e., Pif{u>) = /(w)l^(u;) for every function f on fL 
Then 

\\p x p 2 \\ < ^p\M^p\M + e- st (i - ypj^ypra) , 

where the norm on the left is the operator norm for operators from L 2 (S7,P) into itself. 

Proof. Since -Pi and P 2 are self-adjoint and commuting PiP 2 is also self-adjoint. By the 
definition of norm, duality and the positivity of Pi, 

||PlP 2 || =sup{(PiP 2 /2,/i) : \\fih =hfi> 0} 
= sup{(P 2 / 2 ,Pi/ 1 ) : ||/i|| 2 = l,/i>0}. 

For /i, f 2 such that Pi/i / 0, P 2 f 2 + 0, define 

Pih ,/ P2f2 



J l ~ II D e\\ ' J 2 



I-P1/II2' II-P2/2II2' 
so H//H2 = 1 and supp(/j) C A%, for i = 1,2. Since ||Pj/i|| < 1 and since Pl,P 2 are 
idempotent, we have 

(P 2 / 2 ,Pi/i) < (f 2 J[) = (ft&Pifi). 

Therefore, 

||PiP 2 || = sup [E[hf 2 ] : \\fi\\ 2 = l,fi> 0, supp(/i) C Ai for i = 1,2}. 

Given f\ and f 2 as in the last equation, let g\ : 5 — > M be defined by gi(x) := 
E[/i(cj) I w(a) = x] and 52 (x) := E[/ 2 (w) | w(o + 1) = x\. Clearly, E[ 5i ] = E[/i] and E[<? 2 ] < 
E[/ 2 ] = 1. Since /1 is J< a measurable and / 2 is F> a +t measurable, by the Markov property 
it follows that 

E[/!/ 2 ] =E[E[/x/ 2 \Lj(a),u(a+t)}) =B[ gi (u(a)) g 2 (u(a + t))) =B[ gi T t g 2 }. 

Using the fact that the spectral gap of T% is 1 — e~ St , it follows from Lemma 12.21 below that 
the last expression is bounded by 

B[ gi }B[g 2 ] + e~ st (l - B[ gi ]B[g 2 }) = E[/i]E[/ 2 ] + e^\l - E[/i]E[/ 2 ]), 

and using E[/j] = E[/j 1 A ] < ||/i|| 2 ||l^ lb < a/P[AJ we obtain that 



iiPiP 2 ii < ypWv / pW+e^*(i- ypwv^w^ 

as stated. □ 
The proof of the previous lemma used the following easy fact. 
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Lemma 2.2. Let M be an ergodic transition matrix for a Markov chain on the set S which 
is reversible with respect to the probability measure tt and which has spectral gap 5 > 0. Let 
9ii92 '■ S — > M be two functions with L 2 norm at most one. Then 

EbiM 52 ] < E[ 9l ]E[g 2 ] + (1 - <5)(l - \E[ gi ]E[g 2 }\). 

Proof. Abbreviating 1 = I5, we set hi = g\ — E[gj]l. Then hi is orthogonal to the constant 
functions and 1 is a 1-eigenvector of M. Therefore, 

E[gxMg 2 \ = E[ gi )E[g 2 ] + E[hxMh 2 ]. 
Using the spectral gap of M, we get: 

E[h 1 Mh 2 ] < \\h42WMh2h < (1 - <5)||^i||2||fe||2 < (1 - S)^(l-E[ gi ] 2 )(l-E[g 2 } 2 ) 

<(1-5)(1-|EHEM|), 

where the last inequality follows from the inequality (1 — x 2 )(l — y 2 ) < (1 — xy) 2 , valid for 
all i,y£l, and we are done. □ 

Proof of Theorem 11.21 Let Pi denote the projection onto Ai, as in Lemma 12.11 It is 
easy to see that 

p[f]A]=E[i(flP i )i}=E[t(l[p?) a], 

i=0 i=0 i=0 

since the projection P, satisfies P 2 = Pi, and the order in which the projections act does 
not matter. These operators are also self-adjoint, so that 



fc-i 



E 



[1 ( n pf) 1] = e [(p fc i) p k ( n p f) p ° ( p oi) 



k-l 



i=0 



i=l 



^(11^)^ 



i=i 



fe-i 



< y/P[Ao]y/P[A k ] [J ||PP+l||, 
i=0 

where, in the rightmost expressions, the new notation denotes the operator norm from 
L 2 (0, P) to itself. By Lemma l2"7Tj we have that, for each i £ {0, . . . , k — 1}, 

\\PiPi+x\\ < v /P[AjVP[A + i]+e-^+ 1 - b ^ (1 - ypTWPlA^j 

Hence, 

fc-i fe-i 

n iiPiP i+ iii < n [x/pWVpia+i] + e-- 5 ^ 1 - 6 ') (1 - ypiAiypiA^] 

i=0 i=0 

and the proof is complete. □ 
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3 Examples concerning Theorem 11.1 



3.1 An example where Theorem 11.11 is sharp 

We give our example (of a reversible Markov chain) in terms of conductances on edges; see 
[LP11 j, Chapter 2] for an exposition of the relevant theory. Let (3 £ (1,2). Consider the 
graph with vertex set Z* = Z\{0} and edge set given by the nearest-neighbour edges among 
these vertices, plus the edge (—1, 1), and a self-loop on each vertex. Equip the edges with 
conductances c n ^ n+ \ = c_( re+1 y_ n := n~P for each n > 1, c_i ; i = c^i = c_i._i := 1/2, and 
c n ,n '■= c n ,n-i + c n ,n+i for \n\ > 2. Consider the discrete-time random walk Y on this graph 
equipped with this set of conductances. The sum of the conductances being finite, Y has 
a finite invariant measure tt, whose value at a vertex is the sum of the conductances over 
incident edges; this is the unique invariant measure given its total mass. 

Let P denote the law of Y run in stationarity. Set C = N + , hence = 1/2. We will 
show that 

P [Y s £ C Vs £ [0, t}} = t^r + °^ (3.1) 

and that 

p[Y ,Y t G c]-p[r GC] 2 < ip[y s e c Vs e [o,t]) ; (3.2) 

moreover, for any e > 0, and for all t > sufficiently high, 

P[Y ,Y t eC] -P[y GC] 2 > (£-e)P[y,€CVa€[0,t]]. (3.3) 

These show that Theorem 1 1.1 1 is basically sharp in the regime of polynomial decay; in fact, 
(|3.3p is a stronger bound than what follows from Theorem ll.il As we will see from the proof 
of (|3.2p and (|3.3|) . this is really an example where not leaving the set C at all is "responsible" 
for almost all of the correlation between {Yq £ C} and {Yt £ C}. 

We first prove (|3.ip . which might already be in the literature somewhere, but we could 
not locate a reference. We start with the lower bound. In essence, the bound holds because 
Y has probability of order t( 1- ^)/ 2 to begin at a site of order at least t 1//2 from the vertex 
1. From such a site on the positive half-line, the walk has positive probability to remain 
positive for t steps: indeed, at such sites the walk experiences an excess in leftward transition 
probability over rightward of order t -1 / 2 , so that, during t steps, this imbalance provides 
a drift towards the origin totalling an order of i 1 / 2 steps. This drift is thus comparable to 
the Gaussian fluctuation of the particle during this period, and the particle remains to the 
right of the origin with positive probability. Rather than make this heuristic rigorous, we 
prove the lower bound in (|3.ip by invoking the Carne-Varopoulos bound (see |Car851 IVar85j 
or |LP1H Theorem 13.4]) which, in the case of a reversible Markov chain X having finite 
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stationary measure (and hence spectral radius 1), asserts that 

for all x,y G S, s > 0; here, denotes graphical distance on S, and 7r denotes the 

stationary measure. We apply this bound to the walk Y. Noting that ^fej < Cx~@ 
for x G N, we find that, if s G {0, ...,*}, and x G N satisfies ^v^log t) 1 / 2 < x < 
t 1 /2^(l gt)V2 + t i/2 i 

P[y(s) = 1 | Y(0)=x] < C(t 1 /2 v ^(l ogt )V2 +t l/2^/2 exp |_ 21og ^ 

Sum this bound over s G {0, . . . , t} to arrive at 

P [3s G {0, . . . , t} : Y(s) = 1 | Y(0) = x] < Ct^ 4 ' 1 (v^log t) 1 / 2 + if' 2 . 

Recalling that f3 < 2, we find that, for t high enough, the conditional probability, given 
that Y(0) assumes any one of the t 1 ^ 2 values of x described above, that Y reaches 1 before 
time t, is at most one-half. The probability that V(0) assumes some such value is at least 

C (t 1 /2^(logt)V2 +t l/2)-ty/2 = ct (l-/3)/2(^2 (logt) l/2 + Xhug) 

P [Y (s) > 1 Vs G {0, . . . , t}] > f t^l 2 ( V2(log t) 1 / 2 + l)- f) , 

so that the lower bound in (|3.ip is verified. 

We turn to the upper bound in (13. ip . Let Z denote the Markov chain on Z which 
shares its initial distribution with Y and which evolves as a simple random walk. The two 
processes may be coupled so that, should Y(0) be positive, then Y(m) < Z{m) for all m at 
most the hitting time of 1 by Y; for this reason, it suffices to establish the upper bound in 
(|3.ip for the process Z. 

Let a, t G N. Let Z a : {0, . . . , t} — > N denote simple random walk with Z a (0) = a. By 
[LPW091 Theorem 2.17], P[3s G {0, . . . ,t} : Z a {s) = 0] < 12a*- 1 / 2 . Thus, 

P[Z(a) > 1 V < s < t | Z(0) = a] < 12at" 1/2 . (3.5) 

Multiplying the inequality resulting from (|3.5p by P [Z(0) = a] , we sum over a G N to 
obtain 

t l/2 

P [Z(s) >lV0<s<t] < 12 p = a ] a *~ 1/2 + p > * 1/2 ] • ( 3 - 6 ) 

a=l 

Note that there exists C > such that P[Z(0) = a] < C'a'' 3 for a G N; thus, /? G (1,2) 
implies that each of the two terms on the right-hand side of (|3.6p is at most a constant 
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multiple of t (1_/3)/2 . That is, the upper bound in (|3.ip holds for Z, as we sought to show. 
This completes the proof of (13.11) . 

We now show (|3.2p . Let U be the first time that Y makes the jump (—1, 1), and V be the 
first time that Y makes any of the three jumps (—1, 1), (1, 1), (— 1, —1). Obviously, U > V. 
The point of considering V is that c^i = ci^ = c-\-\ implies that ~P[Yy G C \ Y[o,y)] = 
1/2, and then the symmetry of the entire chain and the strong Markov property implies 
that P[Y t G C | V < t, Y [oy) ] = 1/2, as well. On the other hand, {Y s G C Vs G [0,t]} = 
{Y G C, 17 > t} 5 {Y e C, y > t}, and hence P[Y t G C \ Y G C, V > t] = 1. Therefore, 

p[y , y g c] = p[y g c, y < t] • p[y g c \ y g c, v < t] 
+ p [Yq g c, y > t] • p[y g c | y g c, v > t] 
= (p[y g c] -p[y g c, y >t\) ■ i/2 + p[y g c, v>t]-i (3,7) 

< ip[y EC] +^p[y GC, I7>t] . 

Using that P[y G C] = vr(C) = 1/2 for all t > 0, we get (|5^|> . 

By the second equality in (pTTj) . P[y ,y G C] = | + ^P [F G C, V > t] . The bound 
(|3.3p thus follows from the claim that for each e > 0, and for all t > sufficiently high, 

P[y G C, V > t) > (1 - e)P[y G C, [/ > t]. (3.8) 

The event on the right-hand side is simply {Y s > 1 Vs G [0, t]}; the event on the left-hand 
side contains the event {V s > 2 Vs G [0, t]}. Hence, it is enough to argue that the conditional 
probability of the latter event given the former tends to one in a limit of high t. The event 
that Y s > 1 for all s G [0, t] and Y s = 1 for some such s entails either that Y remains positive 
for time t/2 after first reaching 1 after time 0, or that the same holds for the reversed chain, 
with time running backwards from t; either of these events has probability at most Ct~ l l 2 
by (|3.5|) applied for a = 1. However, the event Y s > 1 Vs G [0,t] has probability at least 
c ^( 1 -/ 3 )/ 2 +°( 1 ) by the lower bound in (|3.ip ; this is much more probable under our hypothesis 
that (3 < 2, so that, given that Z is strictly positive on {0, . . . ,t}, the conditional probability 
that Z visits 1 during this interval tends to zero in high t. In this way, we obtain (|3.8p and 

thus m . 

3.2 Some cases of non-sharpness 

We first give an example where Theorem II. II is not at all sharp. Consider the process Y as 
above, and take C to be the set of even positive integers. The conductances on the self-loops 
are set in such a way that P[y+i G 2Z | Yj] = 1/2, regardless of y. Therefore, 7r(2Z) = 1/2 
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and 7r(C) = 1/4, and, using (|3.2p and (|3.3|) . then (|3.ip . we find that, for t > 1, 

p[y t g c | y G c] = ^Pfc-i > o I y > o] 

= - (i/2 + g(i)p[y, > o vo < s < t- 1 1 y > o]) 

4 

Hence the correlation is polynomially large. On the other hand, P \Y S G C VO < s < f] is 
clearly exponentially small. 

A more complicated but more natural example is given by Corollary 14.11 below. 

The previous subsection showed that Theorem 11.11 is sharp in the regime of polynomial 
decay. Examining the proof of the theorem, we have the feeling that this is not the case in 
the regime of superpolynomial decay. In particular, we have the following question. 

Question 3.1. Does the exponential pairwise decorrelation d{t) = Cexp(— ct) in U.ty) for 
some event C imply an exit time exponential decay P[u s E C Vs G [0, t]l < C"exp(— dt), 
with c' , C depending only on c, C and P[C] = p? 

4 Applications to dynamical percolation 

Critical planar percolation is a central object of probability theory and statistical me- 
chanics; see |Gri991 IWer09] for background. The best understood example is Bernoulli(l/2) 
site percolation on the triangular lattice, where the existence of a conformally invariant 
scaling limit is known. Roughly, if we consider percolation on the lattice of mesh 1/n, 
and any collection Q±, . . . , Qk of conformal images of rectangles, then the joint distribution 
of the left-right crossing events inside these Qi's has a limit that is conformally invariant. 
Moreover, one can define a continuum random limit object encoding all the macroscopic 
crossing events. See [SSmGll] and the explanations and references there. In dynamical 
percolation, every site is switching between being open and closed according to an in- 
dependent exponential clock, in such a way that the stationary distribution on {0, l} Vn is 
critical percolation, where V n is the set of sites, and represents "closed" and 1 represents 
"open" . This model has been studied from three closely related points of view: 

(1) How long does it take to change macroscopic crossings? Or, how noise sensitive 
are the crossing events? A reasonable guess is that this time-scale is given by the 
expected number of pivotal switches in the unit square (i.e., changes of the left- 
right crossing event) being of order one. Let Piv(n) be the expected number of sites 
in critical percolation in the unit square with mesh size 1/n that are pivotal for 
the left-right crossing; it is known for the triangular lattice that Piv(n) = n 3//4+o( - 1 - ) 
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[SWOT]. Then, in the stationary process, using Fubini's theorem and the linearity 
of expectation, the above time-scale is simply n 2 /P\v(n) = n 5 / 4+ °^ 1 \ This guess, 
based merely on the expectation, has been confirmed by [BKS991 ISStlCH [GPSlOaj : if 
tn 2 /Piv(n) = tn 5 / 4 " 1 "^ 1 ) sites are resampled in the unit square, then the correlation 
of crossing before and after the resampling is i~ 2 / 3 , up to constant factors, as t — > oo 
|GPS10al Eq. (8.7)]. Similar, though slightly weaker, results have been proved for 
general conformal rectangles Q with piecewise smooth boundary. Furthermore, even 
for critical (i.e., Bernoulli(l/2)) bond percolation on Z 2 , where the existence of critical 
exponents such as that describing the growth of Piv(n) are not known, it follows 
from the proof of |GPS10a] Corollary 1.2], together with Eq. (2.6) there, that, after 
resampling tn 2 /Piv(n) edges, the correlation is at most 0(t~ a ) for some a > 0. 

(2) On an infinite lattice, are there random times with exceptional behavior, e.g., with 
an infinite cluster? In other words, which events are dynamically sensitive? It 
was proved in [SStlOj that there are exceptional times with an infinite cluster, and 
in [GPSlOa] that their Hausdorff dimension is almost surely 31/36, and that such 
exceptional times also exist for critical dynamical bond percolation on 1? . A natural 
law on the infinite cluster that appears at exceptional times will be introduced and 
studied in |HmPS12j . 

(3) In the unit square (or in another conformal rectangle), with mesh 1/n and a well- 
chosen rate r(n) for the exponential clocks, is there a scaling limit of the process, 
giving a Markov process on continuum configurations? If we choose r(n) = 1/Piv(n) = 
n -3/4+o(i)^ then the expected number of pivotal switches in the unit square during 
a unit time will be exactly 1, independently of n. It is proved in |GPS10b"l IGPSj 
that, with this scaling, such a scaling limit does indeed exist on the triangular lattice. 



Additionally, it follows from the results of |GPS10a] . item (1) above, that the resulting 
Markov chain is ergodic; in particular, the correlation decay for the unit square, in 
rescaled large time t, is t -2 / 3 , up to constant factors. 



We will have an application of Theorem 11.11 to the setup of items (1) and |(3)[ and an 



application of Theorem 11.21 to the setup of item (2) Here is the first of these results: 

Corollary 4.1. In dynamical critical site percolation on the triangular lattice or bond perco- 
lation on 1? , with mesh 1/n and rate 1/Piv(n) for the clocks, consider the left-right crossing 
event C in the unit square. There exist constants {Ck '■ K G N} such that, for each K G N 
and for all t > and n G N, 

(1/4) r»i < p[uj s e cvs g [o,t\] < c K r K . 

On the triangular lattice, it is known that Piv(n) = n 3 / 4 " 1 "^ 1 ), and the above bounds in t also 
hold for the scaling limit of dynamical percolation. 
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Before starting the proof, let us emphasize that this corollary concerns a natural question 
that has exactly the kind of setup for which Theorem 1 1.1 1 is designed. Namely, in the finite 
n version with the discrete-time chain (with sites being resampled one- by-one), the mixing 
time of the entire chain is n 2+0 ^ steps (it is just random walk on an n 2 -dimensional 
hypercube), while the left-right crossing event C decorrelates on the scale n 5 / 4- ^ 1 ), as 
mentioned in item (1) In the scaling limit of the chain, only the evolution of macroscopic 
crossing events is considered, so that 

n 5/4+o(l) ig 

the natural scaling factor needed to obtain 
this scaling limit. In particular, we are interested in the tail probability of exiting C on this 
time scale, a question for which analysis based on the spectral gap of the entire chain would 
clearly be too crude. Moreover, it turns out that the limit chain does not have a spectral 
gap (something which is clear from the polynomial decorrelation t~ 2//3 ), hence the classical 
exponential exit time results |AKSz87] do not apply. One may nevertheless hope that at 
least there would be a "spectral gap restricted to C" , i.e., (T^gTLc) > < (1 — c)(g,g) 
for some c > 0, for all g G L 2 ({0, l} Vn , P1/2)) which, similarly to the proof of Theorem 11.21 
would imply an exponentially small upper bound in Corollary 14.11 However, it is not hard 
to prove that g = Ijdensity of open bits is > 1/2 + n -3 / 4+e }, with e > fixed but small 
enough, is a counterexample. There are slightly more complicated counterexamples that 
make sense also in the scaling limit. 

It is also interesting to note that Corollary 14. 1\ despite being a consequence of The- 
orem II. H provides a natural example in which the theorem by itself is not sharp: the 
correlation decay is polynomial, while the exit time tail is super polynomial. 

Proof of Corollary 14.11 We will work in the discrete lattice setting, i.e., with a fixed 
finite n € Z + . All our results will hold uniformly in n, so that item |(3)| above implies that 
the results extend to the continuum scaling limit. 

Firstly the upper bound. Let L E N, and decompose the unit square into L vertical slabs 
with dimensions 1/L x 1 (the induced subgraphs in the slabs will not be exactly isomorphic 
to each other, but this is not a problem). For s > and i E {1, . . . L}, let Ai(s) denote 
the event that the i th such slab has an open left-right crossing at time s. We further write 
Ai(t) for the intersection of the events Ai(s) over all s G [0,t]. Clearly, 

L 

{oo s ECVsE [0,t]} C P| Ai. (4.1) 

i=l 

We may apply Theorem II. II to bound P [Ai\ . To do so, we need to have a correlation decay 
between the events Ai(0) and Ai(t). Indeed, 

P [Ai (0) , Ai (t )] - P [ Ai (0)] 2 < C' L t~ a 

holds for all i > and some constant C' L , with a = 2/3 in the case of the triangular lattice. 



This is simply the analogue for the slab of the decorrelation bound mentioned in item (F 
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above; if one does not want to optimize the constant C' L , then the proof is identical to 
the one for the square case in [GPSlOal Corollary 1.2]. Noting that the variance of the 
left-right crossing event in any one of the slabs is an L-dependent constant, Theorem 11.11 
may be applied and yields that 

P[Ai(t)] < i- Q+o(1) , (4.2) 

for each i £ {1, . . . , L}, where the o(l) term depends on L. We now use (|4.ip . (j4.2j) and the 
independence of dynamical percolation in disjoint regions to find that 

p[u s £CVs£ [o,t]] <r aL+o{ - l \ 

and the upper bound follows. 

For the lower bound, we need a basic tool that we call the dynamical FKG inequality. 
The next lemma is not the strongest possible form of such a result, but it will suffice for 
our purposes. Firstly, we need some notation. 

A realization of dynamical percolation on a finite graph G may be interpreted as a map 
u) : M. — > {0, Let S denote the space of such maps; we will write u s (x) (for s 6 I 
and x £ V{G)) for the value at time s of lo in bit x. For static configurations, that is, 
elements rj & {0, 1} V ( G \ we consider the natural component-wise partial order: r\ -< rj' iff 
rj(x) < i]'(x) for all x £ V(G). We extend this to a partial order on S by writing lo ^ lo' iff 
u s ^ w' s for all s £R. A function / : S — > K is called increasing if f(uj) < f(Lo') whenever 
oj ^ u'. An event C C S is called increasing if TLq is increasing. 

The standard Harris-FKG inequality for percolation (see |Gri991 Theorem 2.4]) says 
that increasing functions of static configurations are positively correlated. In particular, 
conditioning on an increasing static event makes the percolation configuration "larger". 
Our dynamical FKG inequality deals with conditioning on an increasing dynamical event: 

Lemma 4.2. Let C C S be an increasing event. Let P denote the law of dynamical 
percolation on [0, oo). Then there exists a coupling Q of the laws P and P[- Cl such that, 
denoting the two marginals by lo and io c , we have that Q[wo ^ ] = I- 

Assuming this lemma, if A(t) is the event that the square is crossed for all s G [0, t], 
then, for k £ N, 

P [A((k + l)/2)] > P [A(k/2)] P [.4(1/2)] . (4.3) 

Indeed, Lemma 14.21 implies that, conditionally on A{k/2), the distribution of the marginal 
of dynamical percolation at time k/2 stochastically dominates critical percolation. The 
event A{k/2) being conditionally independent of the subsequent evolution of dynamical 
percolation given the configuration at time k/2, we see that, conditionally on A(k/2), 
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the distribution of dynamical percolation on [k/2,(k + l)/2] stochastically dominates its 
unconditioned counterpart, whence (14. 3j) . 

We claim now that P [.4(1/2)] > 1/4. Indeed, the expected number of pivotal switches 
during a duration of one-half of scaled time is 1/2, by Fubini's theorem, and, by symmetry, 
this remains the case conditionally on there being a left-right crossing at the start of this 
duration. So, by Markov's inequality, the conditional probability of having no pivotal switch 
during this time is at least 1/2. The probability of a left-right crossing being 1/2, we find 
that P [-4(1/2)] > 1/4. Thus, iterating (|43|) gives the lower bound in Corollary gTj □ 

We still owe the proof of the dynamical FKG lemma that we used: 

Proof of Lemma 14.21 Note that the law of uj may be constructed as follows. To 
each x G V(G), we associate a sequence ej(x), i = 1,2,... of independent exponential 
mean one random variables, and an independent sequence bi(x), % = 1,2, ... of independent 
Bernoulli random variables. We set ojq to be a uniform element in {0, 1} V ^ G \ We further 
set ujt(x) = bi{x) where i G N + is minimal subject to Yl)=i e j( x ) ^ W no such i exists, 
then t < e\{x)\ in this case, we set uit(x) = uio(x). 

Write J7 + denote the data ei(x) and bi(x) for i G N + and x G V(G). Note that $7 + 
and coq comprise all of the data that specifies to. As such, we may denote an instance uj of 
dynamical percolation on [0, oo) in the form (ujq,uj + ) G {0, x £l + . 

Suppose given an element oj + G VL + . Note that, if u)q,u}' G {0, 1}^( G ) satisfy ljq ^ u>' , 
then (loq,u + ) G C implies that (wq,o; + ) G C. Hence, by the Harris-FKG inequality for a 
static configuration, the distribution of a;o under P[- | C, w + ] stochastically dominates its 
distribution under P[- | w + ] (which is the uniform distribution). 

Let nc,+ denote the conditional distribution of ui + given C. Then 

p [" I °] = / p [" I C ^ + ] dVcA" + )- (4.4) 

We have shown that, for all choices of uj + , the conditional distribution of ujq given C and 
u + stochastically dominates the uniform distribution on {0, 1} V ( G \ This statement remains 
true after the averaging in (|4.4p . Hence, we find that the law of uq given C stochastically 
dominates its unconditioned law. □ 



Question 4.3. What is the true decay of the probability for having a left-right crossing of 
the unit square during [0, t] in the scaling limit of dynamical percolation? We expect it to 
be exp(— tP+°0-)), with j3 G (0,1). 

We now move to the application of Theorem [L2] to the study of exceptional times, which 



is item (2) in the list at the start of Section HI Consider critical dynamical site percolation 
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(uJt)t>o ° n the infinite triangular lattice or bond percolation on Z 2 (no scaling of space or 
time), and let 

£ := {t G [0, oo) : uj t has < — > 00} 

be the set of exceptional times when the cluster of the origin is infinite. For any fixed time 
t, we have P[i £ £] = 0; hence £ has zero Lebesgue measure almost surely. However, as 



claimed in item (2), it is almost surely nonempty, with Hausdorff dimension 31/36 in the 
case of the triangular lattice. A natural question is how long one has to wait to see the first 
exceptional time. It is answered by the following corollary which will also be an important 
tool for |HmPS12] in studying the infinite clusters that appear in £. 

Corollary 4.4. There exist 00 > c\ > C2 > such that, for critical dynamical site percola- 
tion on the infinite triangular lattice or bond percolation on 1? , 

exp(-ci t) <P[£n [0, t] = 0] < exp(-c 2 t) . 

In other words, the first exceptional time FET := min£ has an exponential tail. Note 
here that, by |HgPS97 Lemma 3.2], the set £ is topologically closed, hence the minimum 



makes sense. Furthermore, ^(coo) is almost surely finite, hence it takes positive time until 
a bit in its boundary d'&^o) first changes its status; thus FET > almost surely. 

Naturally, the proof of the corollary will go through the finite approximations FET# : = 
inf {i G [0,oo) : uj t has < — > (9-Br(0)}. But first of all we need to prove that these times 
are actually approximations to FET: 

Lemma 4.5. We have that FETj? — > FET almost surely as R — >■ 00. 



For the proof, we will need another result proved in HgPS97, Lemma 3.2]: 



Lemma 4.6. Almost surely, the set £ of exceptional times is disjoint from the set of times 
at which the status of a site is updated. 



Proof of Lemma 14.51 By item (2) above, £ n (0,oo) ^ 0, thus FET < 00, almost surely. 
The sequence FET# is increasing and bounded above by FET; thus, there exists some 
random r G (0, 00) such that FET/j r. Assume that r < FET with positive probability, 
which is to say that the cluster ^o( w r) of the origin in u T is finite with positive probability. 
Almost surely, the set of flip times for any given site is a locally finite subset of M, and the 
same holds if we take the union of the flip times over the finite set S := ^o( w t) U (9^o( w r)- 
Therefore, on the event |^o( w t)| < 00, there exists some random e > such that the interval 
(t — e,r + e) either has no flip times for S (and hence the set ^o( w t) remains unchanged), 
or it has a single flip time, r. On the other hand, we know that limsup s ^ T ^(^s)! = 00 
almost surely, which is consistent with the above only if r is a flip time for a site in 8^0(00^, 
closing at exactly time r, and if the reopening of this site creates a configuration in which 
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< — > oo. However, Lemma 14.61 shows that this circumstance has zero probability to occur 
at any time. We conclude that r = FET almost surely, which completes the proof. □ 

Proof of Corollary 14.41 Lemma 14.51 shows that the events {£r n [0, t] = 0} increase to 
the event {£ n [0, t] = 0}. This means that bounds for £r that are uniform in R imply the 
same bounds for £. 

The lower bound is given simply by the probability that the bits (sites or bonds) neigh- 
bouring are closed during [0, t\. 

For the upper bound, in order to apply Theorem 11.21 we need the well-known fact that 
our Markov process has a spectral gap that is uniform in R (it is just continuous time 
random walk on the hypercube {0, l} Bfl (°), with unit rates on the edges), and also need 
that P [S R n [0, 1] / 0] > c> 0, uniformly in R. For the case of the triangular lattice, this is 
part of [SStlOl Theorem 1.3], while, for the case of Z 2 , part of [GPSlOal Theorem 1.5]. □ 

To conclude, let us point out the following interesting phenomenon. Although the results 
of [GPSlOa] behave well under the n vs. 1/Piv(n) = n~ 3//4+ °( 1 ^ space-time scaling, and hence 
it is not hard to show that, in the scaling limit of dynamical percolation in the full plane 
(mentioned in item |(3)| above), exceptional times when the ball of radius 1 is connected to 
infinity do exist and have Hausdorff dimension 31/36 a.s., the tail for the first exceptional 
time is expected to behave differently in the scaling limit than in the discrete case. If 
we try to think of the scaling limit process as unit-order regions flipping between being 
well-connected and not-at-all-connected (analogues of being open and closed in the discrete 
process) at a roughly unit rate, then it seems reasonable that, similarly to the discrete case, 
the tail behaviour of not having the connection from radius 1 to infinity is comparable to 
the obvious lower bound, the tail for not having a connection across the annulus between 
radii 1 and 2. However, we expect this annulus-crossing tail to be subexponential (see 
Question 14. 3p . which would give a subexponential lower bound also here. The same issue 
from a different viewpoint is that we do not have any more the spectral gap that we needed 
in order to apply Theorem 11.21 hence there is no reason to hope for an exponential tail. 
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